Appl. No. 09/844,730 
Amdt dated April 27, 2005 



Amendments to the Specification: 

Please amend paragraph 94 to correct the symbols as follows: 

[0094] [[=]] Mx) = 1 if x is closest to m k , otherwise [[-]] 5k(x) = 0 (resolve ties 
arbitrarily). The summation of these functions over a data set (see (3) and (4)) residing on 
the 7 th unit gives the count, n kh first moment, [[=]] 2*,/, and the second moment, s^u of the 
clusters. The vector {n^ [[=]] ^ s^i | A=l, . • . ,K} 9 has dimensionality 2-AT+^dim, 
which is the size of the SS that have to be communicated between the Integrator and each 
computing unit. 

Please amend paragraph 95 to correct the symbols as follows: 

[0095] The set of SS presented here is more than sufficient for the simple version of K- 
Means algorithm. The aggregated quantity, [[=]] Em s^, could be sent instead of the 
individual But there are other variations of K-Means performance functions that 
require individual Sk,i, for evaluating the performance functions. Besides, the quantities 
that dominate the communication cost are [[=]] I^/,. 

Please amend paragraph 96 to correct the symbols and the equation. In 
the equation, please note the distinction between the summation symbol and the 
subscripted variable E. 

[0096] The I th computing unit collects the SS, {n u , [[=]] 2* /jf | A=l, . . . ,K}, on the 
data in its own memory, and then sends them to the Integrator. The Integrator simply adds 
up the SS from each unit to get the global SS, 
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Please amend paragraph 97 to correct the symbols as follows: 



[0097] The leading cost of integration is 0(^-dim-Z), where L is the number of computing 
units. The new location of the k th center is given by rrik= [[=]] X^/«jt from the global SS 
(this is the /( ) function in (2)), which is the only information all the computing units need 
to start the next iteration. The performance function is calculated by (proof by direct 
verification), 

L 

Pe rfKM = • 
1=1 



Please amend paragraph 102 to correct the equation. Note that the 
numerator in the first summation has changed from "1" to "x". 



[0102] (K-Means is similar, except its weights are the nearest-center membership 
functions, making its centers centroids of the cluster.) Overall then, the recursion equation 
is given by 

1 / _ 1 
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Please amend paragraph 103 to correct the equation. Note that the index 
for the summation is "k" rather than "A", and the denominators in the expression 
for g 2 and g 3 are cubed " 3 " rather than taken to the S power " s ". 



[0103] where d Xi k=\\x-mk\\ and s is a constant ^4. The decomposed functions for calculating 
SS (see (3) and (4)) are then 
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Please amend paragraph 108 to correct the equation. Note that the 
subscript for p is "k" rather than "A", and the symbol for summation must be 
carefully distinguished from the subscripted variable I. 



[0108] In this example, the EM algorithm with linear mixing of K bell-shape (Gaussian) 
functions is described. Unlike AT-Means and ^-Harmonic Means in which only the centers 
are to be estimated, the EM algorithm estimates the centers, the co-variance matrices, 2*, 
and the mixing probabilities, p{mu). The performance function of the EM algorithm is 
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Please amend paragraph 111 to correct the equation. Note that the 
symbol for summation must be carefully distinguished from the subscripted 
variable I*. 

[0111] where p(x\m) is the prior probability with Gaussian distribution, and p(rrtk) is the 
mixing probability. 

P(x\m k )= , 1 =-EXp(-(x-m k )Z(x-m k ) T ^ 
/(2/r) D det(]p t * 7 

p(x\m k ) = ^= 1 -EXp(-(x-m k )i:~ l (x-m k ) T ) 
V(2/r) D det(E*) 



Please amend paragraph 112 to correct the equation. Note that the 
subscript for m is "k" rather than "A", and the symbol for summation must be 
carefully distinguished from the subscripted variable Z. 

[0112] M-Step: With the fuzzy membership function from the E-Step, find the new center 
locations, new co-variance matrices, and new mixing probabilities that maximize the 
performance function. 
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Please amend paragraph 113 to correct the equation. Note that in the 
expression for f 1( the subscript for m is "k" rather than "A". 



[0113] The functions for calculating the SS are: 
/ 1 ( , v,M,S,/0--log 




g 2 {x,M^P) = {p(™\ I x)x,p{m 2 | x)x,...,p{m K \ x)x) 
g 3 (x 9 M,X,p) = (p(mi \ x)x T x,p(m 2 \x)x T x^p{m K \ x)x T x) 
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